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Abstract — In this paper we propose a new construction method 
for rate-1 Fast-Group-Decodable (FGD) Space-Time-Block Codes 
(STBC)s for 2" transmit antennas. We focus on the case of a = 2 
and we show that the new FGD rate-1 code has the lowest worst- 
case decoding complexity among existing comparable STBCs. 
The coding gain of the new rate-1 code is then optimized through 
constellation stretching and proved to be constant irrespective of 
the underlying QAM constellation prior to normalization. In a 
second step, we propose a new rate-2 STBC that multiplexes 
two of our rate-1 codes by the means of a unitary matrix. 
A compromise between rate and complexity is then obtained 
through puncturing our rate-2 code giving rise to a new rate- 
3/2 code. The proposed codes are compared to existing codes in 
the literature and simulation results show that our rate-3/2 code 
has a lower average decoding complexity while our rate-2 code 
maintains its lower average decoding complexity in the low SNR 
region at the expense of a small performance loss. 

Index Terms — Space-time block codes, low-complexity decod- 
able codes, conditional detection, nonvanishing determinants. 

I. Introduction 

The need for low-complexity decodable STBCs is inevitable 
in the case of high-rate communications over MIMO systems 
employing a number of transmit antennas higher than two. The 
decoding complexity may be evaluated by different measures, 
namely the worst-case decoding complexity measure and the 
average decoding complexity measure. The worst-case decod- 
ing complexity is defined as the minimum number of times 
an exhaustive search decoder has to compute the Maximum 
Likelihood (ML) metric to optimally estimate the transmitted 
symbols codeword lU, or equivalently the number of leaf 
nodes in a search tree if a sphere decoder is employed, whereas 
the average decoding complexity measure may be numerically 
evaluated as the average number of visited nodes by a sphere 
decoder in order to optimally estimate the transmitted symbols 
codeword f3\. Arguably, the first proposed low-complexity 
rate-1 code for the case of four transmit antennas is the 
Quasi-Orthogonal (QO)STBC originally proposed by H. Ja- 
farkhani |4| and later optimized through constellation rotation 
to provide full diversity |5|, |6|. The QOSTBC partially 
relaxes the orthogonality conditions by allowing two complex 
symbols to be jointly detected. Subsequently, rate-1, full- 
diversity QOSTBCs were proposed for an arbitrary number 
of transmit antennas that subsume the original QOSTBC as 
a special case |7|. In this general framework, the quasi- 
orthgonality stands for decoupling the transmitted symbols 
into two groups of the same size. However, STBCs with lower 
decoding complexity may be obtained through the concept of 
multi-group decodability laid by the S. Karmakar et al. in 18J, 



Jo] - Indeed, the multi-group decodability generalizes the quasi- 
orthogonality by allowing more than two groups of symbols 
to be decoupled not necessarily with the same size. 

However, due to the strict rate limitation imposed by the 
multi-group decodability, another family of STBCs namely 
Fast Decodable (FD) STBCs [l] has been proposed. These 
codes are conditionally multi-group decodable thus enabling 
the use of the conditional detection technique lITOl which in 
turn significantly reduces the overall decoding complexity. 
Recently, STBCs that combine the multi-group decodability 
and the fast decodability namely the Fast-Group Decodable 
(FGD) codes have been proposed ifTTI . These codes are multi- 
group-group decodable such that each group of symbols is fast 
decodable. The contributions of this paper are summarized in 
the following: 

• We propose a novel systematic construction of rate-1 
FGD STBCs for 2'^ transmit antennas. The rate-1 FGD 
code for a number of transmit antennas that is not a power 
of two is obtained by removing the appropriate number of 
columns from the rate-1 FGD code corresponding to the 
nearest greater number of antennas that is a power of two 
(e.g. the rate-1 FGD STBC for three transmit antennas 
is obtained by removing a single column from the four 
transmit antennas rate-1 FGD STBC). 

• We apply our new construction method to the case of 
four transmit antennas and show that the resulting new 
4x4 rate-1 code can be decoded at half the worst-case 
decoding complexity of the best known rate-1 STBC. 

• The coding gain of the new 4x4 rate-1 code is optimized 
through constellation stretching 1121 and the NonVanish- 
ing Determinant (NVD) property ||T3| is proven to be 
achieved by properly choosing the stretching factor. 

• We propose a new rate-2 STBC through multiplexing 
two of the new rate-1 codes by the means of a unitary 
matrix and numerical optimization. We then propose a 
significant reduction of the worst-decoding complexity at 
the expense of a rate loss by puncturing the rate-2 code 
to obtain a new rate-3/2 code. 

We compare the proposed codes to existing STBCs in the 
literature and found through numerical simulations that our 
rate-3/2 code has a significantly lower average decoding com- 
plexity while our rate-2 code is decoded with a lower average 
decoding complexity at low SNR region. Performance simu- 
lations show that this reduction in the decoding complexity 
comes at the expense of a small performance loss. 

The rest of the paper is organized as follows: The system 



model is defined and the families of low-complexity STBCs 
are outlined in Section II. In Section III we propose our 
scheme for the rate-1 FGD codes construction for the case 
of 2^* transmit antennas, and then the FGD code construction 
method is applied to the case of four transmit antennas giving 
rise to a new 4x4 rate-1 STBC. In Section IV, the rate 
of the proposed code is increased through multiplexing and 
numerical optimization. Numerical results are provided in 
Section V, and we conclude the paper in Section VI. 

Notations: 

Hereafter, small letters, bold small letters and bold capital 
letters will designate scalars, vectors and matrices, respec- 
tively. If A is a matrix, then A^, and A-'^ denote the hermitian 
and the transpose of A, respectively. We define the vec{.) as 
the operator which, when applied to a to x n matrix, transforms 
it into a ran x 1 vector by simply concatenating vertically the 
columns of the corresponding matrix. The ® operator is the 
Kronecker product and the sign{.) operator returns 1 if its 
scalar input is > and -1 otherwise. The round[.) operator 
rounds its argument to the nearest integer. The (.) operator 
concatenates vertically the real and imaginary parts of its 
argument, (a)^ means a modulo n. 

II. Preliminaries 
We define the MIMO channel input-output relation as: 

Y = X H + (1) 

TxJV,. TxNtNtxNr TxNr 

where T is the number of channel uses, Nr is the number 
of receive antennas, Nt is the number of transmit antennas, 
Y is the received signal matrix, X is the code matrix, H is 
the channel matrix with entries hki ^ CJ\f{Q, 1), and W is 
the noise matrix with entries Wij ^ CJ\f{0,NQ). In the case 
of Linear Dispersion (LD) codes |14|, a STBC that encodes 
2K real symbols is expressed as a linear combination of the 
transmitted symbols as: 



2K 



X 



(2) 



with Xfe e M and the A^ 



fc=i 
,fc= 1,. 



., 2K aie T X Nt complex 
matrices called dispersion or weight matrices that are required 
to be linearly independent over M. The MIMO channel model 
can then be expressed in a useful manner by using ^ as: 



Y = 



2K 

E 

k=l 



(AfcH) xk + W. 



(3) 



vec(Y) 



{In, (g, Afe) vec (H) Xk + vec(W). (4) 



Applying the vec{.) operator to the above equation we obtain: 

2K 

E' 

k=l 

where Ijv^ is the Nr x Nr identity matrix. If y^, and 
designate the i'th column of the received signal matrix Y, the 
channel matrix H and the noise matrix W respectively, then 
equation (|4l) can be written in matrix form as: 



" yi ' 
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Thus we have: 



Us 



(5) 



(6) 



A real system of equations is be obtained by applying 
the (.) operator to the (|6|: 

y = ?is + w (7) 
where y,w e M^w.Txi^ ^j^^j ^ ^ ^2N,tx2K ^ Assuming that 

NrT > K, the QR decomposition of H yields: 

n = [Qi Q2] (8) 

where Qi e R^^'-^^'^^ ,Q2 e r2N,tx(2N,t-2K) ^ qTq^ ^ 
I, j = 1, 2, R is a 2K x 2K real upper triangular matrix and 
is a {2NrT - 2K) x 2K null mati'ix. Accordingly, the ML 
estimate may be expressed as: 

s'^L = ai-g min||y - QiRsip (9) 

where C is the vector space spanned by information vector s. 
Noting that multiplying a column vector by a unitary matrix 
does not alter its norm, the above reduces to: 

= arg min||y' - Rs|p (10) 

where y' = y. 

In the following, we will briefly review the known families 
of low-complexity STBCs and the structures of their corre- 
sponding R matrices that enable a simplified ML detection. 

A. Multi-group decodable codes 

Multi-group decodable STBCs are designed to significantly 
reduce the worst-case decoding complexity by allowing sepa- 
rate detection of disjoint groups of symbols without any loss 
of performance. This is achieved iff the ML metric can be 
expressed as a sum of terms depending on disjoint groups of 
symbols. 

Definition 1. A STBC code that encodes 2K real symbols is 
said to be g-group decodable if its weight matrices are such 
that S^, SEl: 



Af A; + Af Afe = 0, VAfc e g„ A, e g,, 

a 

i<i^j<9, m-- 



i=l 



2K. 



(11) 



where Qi is the set of weight matrices associated to the i 'th 
group of symbols. 

For instance if a STBC that encodes 2K real symbols is g- 
group decodable, its worst-case decoding complexity order 
can be reduced from AI^ to %/a7 ' with M being 

the size of the used square QAM constellation. The worst- 
case decoding complexity order can be further reduced to 
^i=i if the conditional detection with hard slicer is 

employed. In the special case of orthogonal STBCs, the worst- 
case decoding complexity is 0(1) as the PAM slicers need 
only a fixed number of arithmetic operations irrespectively of 
the square QAM constellation size. 

B. Fast decodable codes 

A STBC is said to be fast decodable if it is conditionally 
multi-group decodable. 

Definition 2. A STBC that encodes 2K real symbols is said 
to be FD if its weight matrices are such that: 

A^Ai + Af Afe = 0, VAfe € g„ Ai e G,, 

9 

l<i^i<g, \Q^\ =- n,,Y^ n^ = k< 2K. 

i=l 



(12) 



where Qi is the set of weight matrices associated to the i 'th 
group of symbols. 

In this case the conditional detection may be used to 
significantly reduce the worst-case decoding complexity. 
The first step consists of evaluating the ML estimate of 
{xi,...,Xk) conditioned on a given value of the rest 
of the symbols (zfe+i, . . . , X2k) that we may note by 
[x^^ , . . . ,x^^\xk+iT ■ ■ tX2k)- In the second step, the re- 
ceiver will have to minimize the ML metric only over all the 
possible values of {xk+i, ■ ■ ■ ,X2k)- For instance, if a STBC 
that encodes 2K real symbols is FD, its corresponding worst- 
case decoding complexity order for sc^uare QAM constella- 
tions is reduced from to V M x J2i=i ' ■ If 
the FD code is in fact conditionally orthogonal, the worst-case 

2K—k 

decoding complexity order is reduced to vM 

C. Fast group decodable codes 

A STBC is said to be fast group decodable if it is multi- 
group decodable such that each group is fast decodable. 
Deflnition 3. A STBC that encodes 2K real symbols is said 
to be FGD if its weight matrices are such that: 

Af A, + AfAk = 0, VAfe eg,, Aie Gj, 



^ <i ^ j < 9i \Gi\ = nj,y^nj = 2K 



(13) 



and that the weight matrices within each group are such that: 



A^Ai + AfAk = 0, VAfe e g,,,n.. Ai e G,, 



I < m ^ n < gi, \Gi. 



ai 



ki < fii 



(14) 



where Gi,m (resp. gi) denotes the set of weight matrices that 
constitute the m'th group (resp. the number of inner groups) 
within the i 'th group of symbols Gi- 

For instance, if a STBC that encodes 2K real symbols 
is FGD, its corresponding worst-case decoding complexity 
order for square QAM constellations is reduced from 

to ELin/^"" X EfLi%/^"''' ^ Similarly, if each 
group is conditionally orthogonal, the worst-case decoding 

complexity order is equal to Ef=i "v/A/^ 



III. The proposed FGD scheme 

Let the set {I, Ri, . . . , R2Q+1} denote the weight matrices 
of the square orthogonal STBC for 2" transmit antennas |[T5l . 
The proofs of the following propositions are omitted due to 
space limitations. 

Proposition 1. For 2" transmit antennas, the two sets of 
matrices, namely Gi = {I, R-i, • • • , Ra} U A and G2 = 



{Ra+i, . . . , R2a+i}US satisfy ( |ll| l where A and B are given 

in the Table \I\and and Sig{m) are given in Table Ul\ 

'-' TABLE II ' 

Different cases for <5 



a 


<5y((m) 




in 


((™)4-l)((™)4-2) 


2-(m)4 


4n + 1 


{{("•)4){('»)4-l))4 


(m)4-l 


4n + 2 


{{(".)4){('")4-3))4 


(™)4 


4n + 3 


((")4-2)((™)4-3) 
2 


3-(m)4 
2 



Proposition 2. The rate of the proposed family of FGD codes 
is equal to one complex symbol per channel use. 
The weight matrices of our new FGD construction method for 
four and eight transmit antennas are listed in Table |lll| 

TABLE III ' ' 

Examples of rate-1 FGD codes 



Tx 


Gi 


G2 


4 


I, R.2 , R.4t R.iR,3R.5 


Ri , R3 , R5 , R2 R4 


8 


I , R.2 1 R4 • Re 
jRiRsRsRy 
jUi R3 R5 R7R2 
^Ri R3 R5 R7R4 
JR1R3R5R7R6 


Ri , R3 , R5 , R7 
JR2R4R6R1 

J/R2R4R6R-3 
J/R2R4R6R-5 
JR2R4R6R7 



A new rate-1 FGD STBC for four transmit antennas 



According to Table III the proposed rate-1 STBC in the 
case of four transmit antennas denoted Xi may be expressed 
as: 

Xi(s) =1x1 + R22;2 + R4a;3 + RiR3R52;4+ 

(16) 

Ria;5 + RsXg + Il^xj + R2R4a;8. 
According to Deflnition 3, the proposed code X is a FGD 
STBC with g ~ 2,ni ~ n2 — 4: and gi = ,92 = 3 such 
as fiij — 1, i ~ 1,2, j — 1,2,3. Therefore, the worst-case 
decoding complexity order is 2\fM. However, the coding gain 
of Xi is equal to zero, in order to achieve the full-diversity, 
we resort to the constellation stretching |12J rather than 
the constellation rotation technique, otherwise the orthogonal 
symbols inside each group will be entangled together which 
in turns will destroy the FGD structure of the proposed code 
and causes a significant increase in the decoding complexity. 

The full diversity code matrix takes the form of ( [T7] l where 
s = [xi, . . . , Xg] and k is chosen to provide a high coding gain. 
The term is added to normalize the average transmitted 

power per antenna per time slot. 

Proposition 3. Taking k ~ ensures the NVD property 
for the proposed code with a coding gain equal to 1. 

IV. The proposed rate-2 code 

The proposed rate-2 code denoted X2 is simply obtained by 
multiplexing two rate-1 codes by means of a unitary matrix. 
Mathematically speaking, the rate-2 STBC is expressed as: 

X2 [xi, . . ■,xie) = Xi (a;i, . . . ,xs) + e^'^'Xi {xg, . . ■,Xie) U 



TABLE I 
Different cases for A and B 







B 


4n 


{3 n?°i+i Ri} ^U^j Ui^til 1 Rfc; : 1 < fcl < . . - < fc™ < o} 


{jnf=i Ri} ^{i^'^''"^^ n?=i Ri n™ 1 R^. : a + 1 < < . . . < fe,„ < 2a + 1} 


in + 1 


{n?"J+i R,} {j-^ac™) n-"iii Ri n£i r^^ : 1 < fei < ■ - ■ < fc™ < 


^^13 {j"'''''"' n°=i Ri n™ 1 Rfc. : a + 1 < fci < . . . < fc„ < 2a + 1} 


4n + 2 


{li!lt+l yj^i {/.AC™) n-°i+i Ri U^i Rfc^ : 1 < fel < . - . < fc™ < a} 


{n?=i Ri} {j"'''*'"' n"=i RiH" 1 Rfc. : a + 1 < fci < . . - < fcm < 2a + 1} 


4n + 3 


{mfZtli Ri} ^U^j {j'^'^'n^^ili RiH^iRfci : 1 < fci < . . - < fc,„ < a} 


JiJ^3 {j-^BC™) n°^i Ri n" 1 Rfc; : a + 1 < fci < . . . < fc,„ < 2a + 1} 



where U and (f) are chosen in order to maximize the coding 
gain. It was numerically verified for QPSK constellation that 
taking U — j'Ri and 0opt = tan^^ (i) maximizes the coding 
gain which is equal to 1. To decode the proposed code, the 
receiver evaluates the QR decomposition of the real equivalent 
channel matrix "H (|7]). The corresponding upper-triangular 
matrix R takes the form: 

A B 
C 

where B e M*^® has no special structure, C G R^^^ 



TABLE rV 

SUMMARY OF COMPARISON IN TERMS OF WORST-CASE DECODING 
COMPLEXITY 



R 



Code 


Square QAM 
decoding complexity 


The proposed rate- 3/2 code 




The punctured P.Srinath-S.Rajan rate-3/2 code 16 




The S.Sirianunpiboon et a!, code 17 


M" 


The proposed rate-2 code 




The rate-2 P.Srinath-S.Rajan code 116| 





^^^^ simplified Schnorr-Euchner enumeration ifTSl . From Fig. [T] 
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(19) 



in which x indicates a possible non-zero position. For each 
value of {xg, . . . ,xiq), the decoder scans independently all 
possible values of X4 and xs, and assigns to them the cor- 
responding 6 ML estimates of the rest of symbols via hard 
slicers according to (20)-(|2T]l. where: 




16 



- ri^^Xi 



'^r^^kikj/rui, « = 1,2, 3 



-a — Proposed rate-3/2 code (QPSK) 

■ a - P.Srinath-S.Rajan rate-3/2 code (QPSK) 

■ a - S.Sirianunpiboon ef a/, code (QPSK) 
-V — Proposed rate-3/2 code (16-QAIVI) 

■ V - P.Srinath-S.Rajan rate-3/2 code (16-QAIVI) 

■ V - S.Sirianunpiboon eta/, code (16-QAM) 

5 to 15 

SNR per receive antenna 



Fig. 1. BER peii'oiTnance for 4x2 configuration 



k=9 
16 

fe=9 

will denote 



ID 



5,6,7 



'-3/2 



may be easily 



A rate-3/2 code that we 

obtained by puncturing the rate-2 proposed code X2 and may 
be expressed as: 

fa;i , X12) = Xi fxi Xf^)+e^'^°^'X^ (xg, . . . ,2:12) U 



(a;i, . . . , X12) = Xi (xi, . . . , a;8)-l-e^'^°'"Xi (xg, . 
V. Numerical and simulation results 



In this section, we compare our proposed codes to com- 
parable low-complexity STBCs existing in the literature in 
terms of worst-case decoding complexity, average decoding 
complexity and Bit Error Rate (BER) performance over quasi- 
static Rayleigh fading channels. One can notice from Table IV 
that the worst-case decoding complexity of the proposed rate- 
3/2 code is half that of the punctured rate-3/2 code in fT^l 
and is reduced by a factor of w.r.t to the worst-case 

decoding complexity of the rate-3/2 code in ifTTl . Moreover, 
the worst-case decoding complexity of our rate-2 code is half 
that of the code in |16|. Simulations are carried out in a 
quasi-static Rayleigh fading channel in the presence of AWGN 
and 2 receive antennas for our rate-3/2 and rate-2 codes. The 
ML detection is performed via a depth-first tree traversal with 
infinite initial radius SD. The radius is updated whenever a leaf 
node is reached and sibling nodes are visited according to the 




-H — Proposed rate-3/2 code (QPSK) 
a - P.Srinath-S.Rajan rate-3/2 code (QPSK) 
a - S.Sirianunpiboon ef a/, code (QPSK) 

-V — Proposed rate-3/2 code (16-QAIVI) 

V - P.Srinath-S.Rajan rate-3/2 code (16-QAM) 

V - S.Sirianunpiboon ef a/, code (16-QAM) 



to 15 
SNR per receive antenna 



Fig. 2. Average complexity for 4x2 configuration 

one can notice that the proposed rate-3/2 code gains about 
0.4 dB w.rt to S.Sirianunpiboon et al. code ifTTll while it loses 
about 0.5 dB w.rt to the punctured RSrinath-S.Rajan code fTSl 
at 10^"^ BER. From Fig. |2j it can be noticed that the average 
complexity of our rate-3/2 code is significantly less than that of 
the punctured RSrinath-S.Rajan code lfT6l and roughly equal 
to that of S.Sirianunpiboon et al. code ifTTI . From Fig. [3] one 
can notice that the proposed rate-2 code loses about 0.8 dB 
w.r.t the P.Srinath-S.Rajan code fW\ at 10~^ BER. However, 
from Fig. |4] one can notice that our proposed code maintains 



X1(S): 



1-1- A:2 



x\ + ikx5 X2 + ikxQ 

—X2 + ikxn xi — ikxs 

~x:j + ikxr ikx4 + xg 

ikx4 + xs X3 — ikx-j 



X3 -|- ikx-j —ikx4 — xs 

—ikx4, — Xs —X3 — ikx7 

xi — ikxs X2 + ikxQ 

—X2 + ikx6 x\ + ikx5 



(17) 



f^l (£4,^9, . . • ,£l6) = (zi) X min[|2 roimc/( (zi - 1) /2) + l|, \/M - ij , i = 1, 2,3 (20) 

I (X8,xg, . . . ,£i6) = sign{zj) X mm^\2 round{{zj - 1) /2) + 1\,VM - 1^, j = 5,6,7 (21) 



X 




-B — Proposed rate-2 oode (QPSK) 
■ B - P.Srinath-S.Rajan code (QPSK) 



5 10 
SNR per reoeive antenna 



Fig. 3. BER performance for 4x2 configuration 




5 10 
SNR per receive antenna 

Fig. 4. Average Complexity for 4x2 configuration 

its lower average decoding complexity in the low SNR region. 

VI. Conclusion 

In the present paper we have proposed a systematic ap- 
proach for the construction of rate-1 FGD codes for an 
arbitrary number of transmit antennas. This approach when 
applied to the special case of four transmit antennas results in 
a new rate-1 FGD STBC that has the smallest worst-case de- 
coding complexity among existing comparable low-complexity 
STBCs. The coding gain of the proposed FGD rate-1 code 
was then optimized through constellation stretching. Next we 
managed to increase the rate to 2 by multiplexing two rate- 
1 codes through a unitary matrix. A compromise between 
complexity and throughput may be achieved through punc- 
turing the proposed rate-2 code which results in a new low- 
complexity rate-3/2 code. The worst-case decoding complexity 
of the proposed codes is lower than their STBC counterparts in 
the literature. Simulations results show that the proposed rate- 
3/2 code offers better performance that the S.Sirianunpiboon 



et al. code ifTTl but loses about 0.5 dB w.r.t the punctured 
P.Srinath-S.Rajan code |16| at lO^^^ BER. The proposed rate- 
2 code loses about 0.8 dB w.rt the RSrinath-S.Rajan code 
lfT6l at lO^'' BER. In terms of average decoding complexity, 
we found that the proposed rate-3/2 code has a lower average 
decoding complexity while the proposed rate-2 code maintains 
its lower average decoding complexity in the low SNR region. 
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